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ABSTRACT 

We use a sample of 37 of the densest clusters and protoclusters across 1.3 ^ z < 3.2 
from the Clusters Around Radio-Loud AGN (CARLA) survey to study the formation 
of massive cluster galaxies. We use optical I'-band and infrared 3.6/im and 4.5 fim im¬ 
ages to statistically select sources within these protoclusters and measure their median 
observed colours; {i' — [3.6]). We find the abundance of massive galaxies within the 
protoclusters increases with decreasing redshift, suggesting these objects may form an 
evolutionary sequence, with the lower redshift clusters in the sample having similar 
properties to the descendants of the high redshift protoclusters. We find that the pro¬ 
tocluster galaxies have an approximately unevolving observed-frame i' — [3.6] colour 
across the examined redshift range. We compare the evolution of the {i' — [3.6]) colour 
of massive cluster galaxies with simplistic galaxy formation models. Taking the full 
cluster population into account, we show that the formation of stars within the ma¬ 
jority of massive cluster galaxies occurs over at least 2Gyr, and peaks at z ~ 2-3. 
From the median i' — [3.6] colours we cannot determine the star formation histories 
of individual galaxies, but their star formation must have been rapidly terminated to 
produce the observed red colours. Finally, we show that massive galaxies at z > 2 
must have assembled within 0.5 Gyr of them forming a significant fraction of their 
stars. This means that few massive galaxies in z > 2 protoclusters could have formed 
via dry mergers. 

Key words: galaxies: clusters: general ; galaxies: high-redshift ; galaxies: evolution 
; galaxies: formation 


1 INTRODUCTION 

In the local Universe, most massive cluster galaxies are old 
and have little-to-no ongoing star formation. They form 
a very homogenous, slowly-evolving population, exhibit- 
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ing similar, red colours. When viewed in colour-magnitude 
space, these massive, old galaxies form a characteristic “red 
sequence”. Such red sequences of galaxies are nearly ubiqui¬ 
tous in low redshift clusters, and persist out to z ~ 1.5 (e.g. 


Blakeslee et al.||2003| [Holden et al.||2004| |Mei et ar]|2006| 
Eisenhardt et al.|2008 1. Red sequences have commonly been 


used to examine the formation history of massive cluster 
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galaxies. The colour, slope and low scatter of the red se¬ 
quence within clusters are consistent with early-type clus¬ 
ter galaxies forming concurrently in a short burst of star 
formation at high redshift {z > 2) and passively evolving 
thereafter (e.g. Bower, Lucey fc Ellis|1992 Eisenhardt et al. 


2007), although there are indications that further star for¬ 


mation occurs within galaxies towards the outskirts of the 
cluster ( Ferre-Mateu et al.||2014 |. 

The red sequences of low redshift clusters indicate a 
high formation redshift, though it is difficult to determine 
the exact epoch and history of galaxy formation using their 
galaxy colours. The colours of galaxies that have been pas¬ 
sively evolving for more than a few billion years are very 
similar ( Kauffmann et ar]|2003 ). Thus it is difficult to dif¬ 
ferentiate between formation redshifts if the time between 
the galaxy’s formation and observation is several Gyr. Ob¬ 
serving galaxies closer to their formation period makes it 
possible to measure galaxy ages more accurately. By mea¬ 
suring the colours of galaxies within high redshift clusters 
we can determine the exact epoch and history of formation 
and break the degeneracies between single collapse models 
and those which include extended periods of galaxy growth 
(e.g. Snyder et al.||2012 |. 

The scatter in colour of early-type cluster galaxies at 
2 > 1 is low and consistent with passive evolution for z < 2.3 


(e.g. Stanford, Eisenhardt k, Dickinson|1998|[Blakeslee et al.| 


2003 Mei et al.||2006 Lidman et al.||2008 ). However, when 

examining the full cluster population, it is no longer possible 
to model the galaxy formation history with a single forma¬ 
tion timescale, but rather there is a scatter in the inferred 
ages (e.g. Eisenhardt et al.|2008 Kurk et al.|2009 1. Clusters 
at 2 > 1 were much more active than they are today; they 


exhibited significant ongoing star formation (Snyder et al. 


20^ [Brodwin et al.|2013||Zeimann et al.|2013[ Alberts et al. 


et al. 


2010 


2014), merging between galaxies (Mancone et al. |2010| |Lotz 


2013 ) and increased AGN activity ( Galametz et al. 
Martini et al.|20T3). The red sequence of clusters and 


protoclusters was much less populated at z > 1 than today 


(e.g. Kodama et al.pOOT Hatch et al.||20lT Rudnick et al. 


2012), which means some of the progenitors of local red se¬ 


quence galaxies would have bluer colours and lie below the 
red sequence at z > 1. Thus, when tracing the evolution of 
just the galaxies that already lie on the red sequence, high 
redshift studies are prone to progenitor bias (van Dokkum 


& Franx 2001). To robustly trace the evolution of cluster 


galaxies it is important to study all the progenitors; those 
that are already passive at high redshift, and those that only 
become passive at a later time. 

To trace the early formation history of massive clus¬ 
ter galaxies, we have undertaken a large survey of clusters, 
pushing the study of galaxy colours to even higher redshifts. 
Our sample extends from z = 1.3 out to z = 3.2, covering 
the full timescale of massive cluster galaxy formation mea¬ 
sured from previous works (e.g. Blakeslee et al.|[2003 ). We 
have taken our cluster sample from the Clusters Around 


Radio-Loud AGN (CARLA; Wylezalek et al. 2013) sur¬ 


vey. CARLA is a 400-hour Warm Spitzer Space Telescope 
programme designed to locate and investigate clusters of 
galaxies at z > 1.3. Radio Loud Active Galactic Nuclei 
(RLAGN) preferentially reside in dense environments (e.g. 
[Hatch et al.||201^ and references therein), so the CARLA 
survey targeted the environment of 419 powerful RLAGN 


lying at 1.3 < z < 3.2 and having a 50 0 MHz luminosity 
^ lO^^.s wHz“^. Wylezalek et al. (2013) showed that 55% 


of these RLAGN are surrounded by significant excesses of 
galaxies with red [3.6] — [4.5] colours that are likely associ¬ 
ated with the RLAGN and therefore are likely to be high 
redshift clusters and protoclusters. 

The luminosity functions of the CARLA cluster galaxies 


were investigated in Wylezalek et al. (2014), showing that 


passive galaxy evolution models were consistent with their 
measured m* up to z ~ 3.2. In this paper we investigate the 
formation epoch and star formation history of the cluster 
galaxies. We observe 37 of the densest CARLA fields in the 
i! band and calculate the average observed i! — [3.6] colour 
of the cluster galaxies. We avoid progenitor bias by mea¬ 
suring the colours of all M* > 10'^°'® Mq cluster galaxies to 
estimate the overall average colour evolution of massive clus¬ 
ter galaxies. Although using average colours loses informa¬ 
tion about the individual cluster galaxies, if they are formed 
as concurrent bursts, then averaging will reduce photomet¬ 
ric errors. If the galaxy population is more diverse, then 
by modelling that diversity we can compare to the average 
colour, which still contains information about the popula¬ 
tion. The scatter in colours can also be used to reduce the 
effect of degeneracies between models. 

In Section we describe our data and control fields, as 
well as the methodology of measuring cluster galaxy prop¬ 
erties. Section [^presents our key results, the implications of 
which are discussed in Section (4] Our conclusions are sum¬ 
marised in Section We assume a ACDM cosmology with 
h = 0.70, Hm = 0.3 and Ha = 0.7 throughout, unless stated 
otherwise. Magnitudes are expressed in the AB system. 


2 METHOD 

In this Section we describe our methodology and datasets. 
We have two cluster samples: a high redshift (1.3 < z < 3.2) 
sample from the GARLA survey (Section |2.1| ), and a low 
and intermediate redshift sample (0.1 < z < 1.8) taken from 
the IRAG Shallow Gluster Survey (ISCS, described in Sec¬ 
tion 2.5). We also utilise a control field sample, from the 


UKIDSS Ultra Deep Survey (Section 2.2), in order to statis¬ 
tically subtract field contaminants from our cluster samples. 
Our selection of high redshift galaxies and cleaning of fore¬ 
ground interlopers is described in Section [2.3| Our method 
in calculating the colours of (proto) cluster galaxies is then 
described in Section [2.41 


2.1 High redshift cluster sample: CARLA 

2.1.1 Data 

Infrared (IR) data at 3.6 pm and 4.5 pm for the CARLA 
sample were taken during Cycles 7 and 8 using the Spitzer 
Infrared Array Camera (IRAG; Fazio et al.||2004 ). The im¬ 
aged fields are 5.2 x 5.2 arcmin^, with the 4.5 pm data being 
slightly deeper; 95% completeness is reached at magnitudes 
of ]3.6] = 22.6 and [4.5] = 22.9. For a full description of 
the IR observations and data reduction see [Wylezalek et al.j 

( M^ . 

We complement our existing Spitzer dataset with i' 
band imaging. The i' and 3.6 pm bands bracket the 4000 A 
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Field 

RA 

DEC 

z 

Instrument 

Exp. time (s) 

5cr depth 

Seeing (arcsec) 

Density“ 

J085442.00-I-57572 

08:54:42.00 

57:57:29.16 

1.317 

ACAM 

6600 

24.59 

1.00 

4.1 

J110020.21-I-09493 

11:00:20.16 

09:49:35.00 

1.321 

GMOS-S 

2405 

24.86 

0.65 

3.3 

J135817.60-I-57520 

13:58:17.52 

57:52:04.08 

1.373 

ACAM 

8400 

24.95 

0.89 

6.2 

MRC0955-288 

09:58:04.80 

-29:04:07.32 

1.400 

GMOS-S 

2645 

25.02 

0.58 

2.7 

7C1756-I-6520 

17:57:05.28 

65:19:51.60 

1.416 

ACAM 

10205 

24.26 

1.73 

4.4 

6CE1100-1-3505 

11:03:26.40 

34:49:48.00 

1.440 

ACAM 

9600 

25.19 

1.12 

6.3 

TXS 2353-003 

23:55:35.87 

-00:02:48.00 

1.490 

ACAM 

6000 

24.99 

0.81 

5.6 

J112914.10-I-09515 

11:29:14.16 

09:51:59.00 

1.519 

GMOS-S 

2645 

24.78 

0.44 

6.3 

MG0122+1923 

01:22:30.00 

19:23:38.40 

1.595 

ACAM 

7200 

24.83 

0.79 

3.2 

7C1753-I-6311 

17:53:35.28 

63:10:49.08 

1.600 

ACAM 

6000 

25.08 

0.74 

4.5 

BRL1422-297 

14:25:29.28 

-29:59:56.04 

1.632 

GMOS-S 

2689 

24.87 

0.62 

3.5 

J105231.82-I-08060 

10:52:31.92 

08:06:07.99 

1.643 

GMOS-S 

2645 

25.04 

0.56 

5.0 

TNJ0941-1628 

09:41:07.44 

-16:28:03.00 

1.644 

GMOS-S 

2645 

25.03 

0.58 

4.7 

6CE1141-I-3525 

11:43:51.12 

35:08:24.00 

1.780 

ACAM 

10187 

24.89 

1.59 

4.5 

6CE0905-I-3955 

09:08:16.80 

39:43:26.04 

1.883 

ACAM 

7200 

25.09 

0.89 

4.1 

MRC1217-276 

12:20:21.12 

-27:53:00.96 

1.899 

GMOS-S 

2645 

24.96 

0.69 

3.5 

J101827.85-I-05303 

10:18:27.84 

05:30:29.99 

1.938 

ACAM 

7200 

25.19 

0.81 

5.0 

J213638.58-I-00415 

21:36:38.64 

00:41:53.99 

1.941 

ACAM 

6300 

25.00 

0.69 

5.7 

J115043.87-00235 

11:50:43.92 

-00:23:54.00 

1.976 

GMOS-S 

2645 

25.04 

0.62 

2.7 

J080016.10-I-402955.6 

08:00:16.08 

40:29:56.40 

2.021 

ACAM 

6600 

25.16 

0.93 

6.3 

J132720.98-I-432627.9 

13:27:20.88 

43:26:27.96 

2.084 

ACAM 

7200 

25.03 

0.75 

4.1 

J115201.12-1-102322.8 

11:52:01.20 

10:23:22.92 

2.089 

ACAM 

7200 

25.18 

1.13 

4.2 

TNR 2254-1-1857 

22:54:53.76 

18:57:03.60 

2.154 

ACAM 

9000 

25.41 

1.10 

5.6 

J112338.14-I-052038.5 

11:23:38.16 

05:20:38.54 

2.181 

GMOS-S 

2645 

25.00 

0.64 

4.7 

J141906.82-I-055501.9 

14:19:06.72 

05:55:01.92 

2.293 

ACAM 

7800 

24.56 

1.28 

4.5 

J095033.62-I-274329.9 

09:50:33.60 

27:43:30.00 

2.356 

ACAM 

7200 

24.92 

1.31 

4.4 

4C 40.02 

00:30:49.00 

41:10:48.00 

2.428 

ACAM 

8700 

25.26 

0.94 

4.2 

J140445.88-013021.8 

14:04:45.84 

-01:30:21.88 

2.499 

GMOS-S* 

2640 

25.08 

0.38 

2.7 

J110344.53-I-023209.9 

11:03:44.64 

02:32:09.92 

2.514 

ACAM 

7800 

24.76 

1.33 

4.1 

TXS 1558-003 

16:01:17.28 

-00:28:46.00 

2.520 

GMOS-S 

2405 

24.94 

0.91 

2.4 

J102429.58-005255.4 

10:24:29.52 

-00:52:55.43 

2.555 

GMOS-S 

2645 

25.14 

0.53 

4.5 

J140653.84-I-343337.3 

14:06:53.76 

34:33:37.44 

2.566 

ACAM 

7800 

25.19 

0.73 

3.3 

6CSS0824-I-5344 

08:27:59.04 

53:34:14.88 

2.824 

ACAM 

7200 

24.65 

1.17 

4.1 

J140432.99-I-072846.9 

14:04:32.88 

07:28:46.96 

2.864 

ACAM 

8400 

25.06 

1.02 

4.1 

B2 1132-1-37 

11:35:06.00 

37:08:40.92 

2.880 

ACAM 

7200 

25.19 

0.98 

5.9 

MRC 0943-242 

09:45:32.88 

-24:28:50.16 

2.922 

GMOS-S 

2645 

24.83 

0.44 

4.7 

NVSS J095751-213321 

09:57:51.36 

-21:33:20.88 

3.126 

GMOS-S 

2645 

25.01 

0.61 

2.7 


Table 1. Details of i' band observations for the 37 CARLA fields observed with AC AM on the WHT and GMOS-S on Gemini South. 

“ Density is the number of standard deviations from the average field density of red IRAC sources, as calculated in |Wylezalek et al.| 
|[ 20 ^. 

*i' imaging of J140445.88—013021.8 was taken with the new Hamamatsu GCDs on GMOS-S, with reduced fringing effects. 


break at the redshifts covered by the CARLA survey, and 
allow direct comparison to previous work at lower redshifts 
(Eisenhardt et al. 20081. The extra i' band also allows us 
to refine the selection of cluster member galaxies, providing 
more detail in order to study the evolution of clusters more 
thoroughly than before. 


Optical i! band data were obtained for 37 of the densest 
CARLA fields with the auxiliary-port camera (ACAM) on 
the 4.2 m William Herschel Telescope (WHT) in La Palma 
and the Gemini Multi-Object Spectrograph South instru¬ 
ment (GMOS-S; |Hook et al. 20041 on Gemini-South in Chile. 
These fields were selected as the CARLA targets which 
contained the highest densities of sources with red IRAC 
[3.6] — [4.5] colours that were visible at the latitudes of each 
telescope. Densities of most of these fields are 4-6cr denser 
than an average blank field (see Table [^, as calculated in 


Wylezalek et al. (20141, with a few at 2-3 (j due to higher 


density fields not being observable. Fields with bright stars 
(m^/ Si 10) in the field of view were rejected to avoid satu¬ 
ration and bleeding in the images. Fields with known low- 


redshift clusters within the field of view were also rejected 
to avoid biasing our measurements of overdensities. 

Twenty-three fields were imaged with ACAM during the 
period 2013 September-2014 December. The field of view of 
ACAM is circular, with a diameter of 8.3arcmin and pixel 
scale 0.25 arcsec pixel” 

The remaining 14 helds were imaged with GMOS-S. 
The majority of the GMOS-S data was taken using the 
EEV detectors between February and April 2014, though 
one field (J140445—013021.8) was imaged in June 2014 with 
the new Hamamatsu GCDs. GMOS-S covers an area of 
5.5 X 5.5arcmin^ with a pixel scale of 0.146arcsecpixel“^. 
Exposure times were adapted to take varying seeing into ac¬ 
count, in order to obtain a consistent depth across all fields 
(see Table [^. 


2.1.2 Data reduction 

The i' band images were reduced using the publicly available 
THELI software ( jErben et al.|[200^ |Schirmer||2013| ). The 
data were de-biased and flathelded using a superflat created 
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from median-combining all images taken on the same night. 
A fringing model was created and subtracted and the sky 
background was subtracted. The GMOS-S data taken with 
the old EEV detectors exhibited bad fringing (typically ~ 
67% of the background), whereas the data taken with the 
Hamamatsu CCDs has significantly reduced fringing effects 
(< 1% of the background). 

Within THELI, astrometric solutions for the images 
were derived using SCAMP (Bertin 20061 with catalogues 
from the Sloan Digital Sky Survey (SDSS), or the Two Mi¬ 
cron All Sky Survey (2MASS) where fields were not covered 
by the SDSS. Each field was mean-combined using SWarp 
I Bertin et al. 20021 within the THELI software. The reduced 
images were flux calibrated by comparing unsaturated stars 
in each field to the SDSS where possible. Two WHT fields 
were not covered by SDSS and were flux calibrated with un¬ 
saturated stars in SDSS-covered exposures taken immedi¬ 
ately before and after the observations. The GMOS-S fields 
not covered by SDSS were flux calibrated using standard star 
observations. We compared the different calibration meth¬ 
ods for fields with both SDSS coverage and standard star 
observations and find that calibrations with standard stars 
differ from calibrations with SDSS by ^ 0.05 mag. Details of 
the i' band data for each of the 37 CARLA fields are given 
in Table [T] 


2.1.3 Source extraction 


Source extraction was performed using SExtractor (Bertin 
fc Arnouts|1996 1 in dual image mode, using the 4.5 fim im¬ 


ages for detection and performing photometry on the 3.6 fim 
images. IRAC fluxes were measured in 4arcsec apertures 
and corrected to total fluxes using aperture corrections of 
1.42 and 1.45 for 3.6 /rm and 4.5/rm respectively (|Wylezalek| 
et al.|2013 l. At the RA and DEC of each source detected at 
4.5/rm, i'-band fluxes were measured using the IDL APER 
routine. These fluxes were measured in either 2.5arcsec or 
3.2 arcsec diameter apertures, depending on the full-width 
half-maximum (FWHM) of the image. The aperture sizes 
were chosen to be ~ 2.5x the seeing and were a compro¬ 
mise between including as much flux as possible, and avoid¬ 
ing blending. The larger aperture was used for images with 
seeing > 1.15 arcsec (see Table[^. These i' fluxes were then 
corrected to total flux using correction factors (typically 1.15 
and 1.04 for ACAM and GMOS-S data respectively) mea¬ 
sured from the growth curves of unsaturated stars in the 
images. 

Image depths, shown in Table were calculated by 
measuring the flux in ~ 100, 000 random apertures, with 
the aperture size dependent on the FWHM of each im¬ 
age (as above). The median Icr depth of the WHT data 
is i' = 26.79 mag, and for the Gemini data is i' = 26.75 mag. 
Due to the similarity in depth of all the fields, the overall 
median depth of i' = 26.76 mag (= lamed) was used for all 
fields. 

Golours were calculated from aperture-corrected i!— 
aperture-corrected 3.6 pm magnitudes. The colours derived 
from the control field (see Section |2.2[ ) were calculated in 
exactly the same way as the (proto)cluster sample. The dis¬ 
tribution of colours of all galaxies in the (proto)cluster fields 
are consistent with the control field so there are no system¬ 
atic errors in our colour measurements. 


2.2 Control field: UDS and SpUDS 

We utilise the UKIDSS Ultra Deep Survey (UDS; PI O. 
Almaini) to statistically subtract contamination from fore- 
and background galaxies in the cluster fields. The UDS is 
a deep 0.8 deg^ near-infrared survey, which overlaps part of 


the Subaru/AMM-Ae«)ton Deep Survey (SXDS; Furusawa 
et al.|2008| ). 

Galaxies in the UDS have photometric redshift informa¬ 


tion derived in Hartley et al. (20131 using 11 photometric 


bands. The A-band selected catalogue incorporates [/-band 
data from the Canada-France-Hawaii Telescope (Foucaud et 
al., in preparation), optical BVRi'z' photometry from the 
SXDS, JHK photometry from the 8th data release (DR8) 
of the UDS, and Spitzer Ultra Deep Survey 3.6 and 4.5 pm 
data (SpUDS; PI J. Dunlop). Photometric redshifts were de¬ 
termined by fitting spectral energy distribution (SED) tem¬ 
plates to the photometric data points. The resulting disper¬ 
sion is A2/(1 -|- z)= 0.031. 

The SpUDS survey is a 1 deg^ Cycle 4 Spitzer Legacy 
program that encompasses the UDS field. We use the SpUDS 
3.6 and 4.5 pm catalogues of Wylezalek et al. ( |2013 1, which 
were extracted from the public mosaics in the same way 
as for the CARLA survey. Catalogues were created using 
SExtractor in dual-image mode, using the 4.5 pm image as 
the detection image. The SpUDS data reach 30" depths of 
~ 24 mag at both 3.6 and 4.5 pm, but here we use only 
sources down to the shallower depth of the CARLA data. 

The i' band data covering the 0.8 deg^ UDS was ob¬ 
tained as part of the SXDS and resampled and registered 


onto the UDS A-band image (see Cirasuolo et al. 20101. 


The 5a limiting depth is i' = 27.2 mag, but we use only 
sources down to the shallower depth of the CARLA fields. 
Any source with i' magnitude fainter than the median depth 
is limited to lamed. Fluxes were measured in the same way 
as for the CARLA sample, using positions from the 4.5 pm 
catalogue and measuring fluxes using the IDL APER routine 
with 2.5 arcsec diameter apertures. 


2.3 Selection of high redshift galaxies 

The cluster membership of the galaxies in the CARLA fields 
is not yet known, therefore we determine the average i' —[3.6] 
colours of the cluster members by statistically subtracting 
the back- and foreground galaxies. Statistical subtraction 
is most accurate when the cluster members are the domi¬ 
nant population in the sample; however, for the high redshift 
CARLA clusters, the number of interloping galaxies out¬ 
weighs the number of cluster members. We therefore make 
a spatial cut and two colour cuts to pre-select galaxies that 
are most likely to be cluster members. 


2.3.1 Spatial selection 

To select the highest fraction of (proto)cluster galaxies to 
interlopers, we only consider sources within 1 arcmin of the 
central RLAGN for each CARLA field. [Wylezalek et ah] 
( |2013| ) showed that the galaxy density was highest within 
this region. At these redshifts, 1 arcmin is ~ 500 kpc in phys¬ 
ical coordinates. In co-moving coordinates this corresponds 
to ~ 1.8 Mpc radius at « ~ 3 and ~ 1.2 Mpc radius at 
2 = 1.3. This decreasing co-moving radius with time traces 
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the expected collapse of the cluster. Although protoclusters 
always collapse in the co-moving reference frame, in physical 
and angular coordinates the size of the protocluster remains 
approximately constant across 1.3 < z < 3.2 as gravity is 
almost balanced by the Hubble expansion (for a full explana¬ 
tion see Muldrew, Hatch & Cooke 20151. Whilst our 1 arcmin 
aperture can only capture a fraction of these protoclusters, 
it encloses approximately the same fraction at all redshifts 
between 2 = 1.3 and 2 = 3.2. Assuming the CARLA clus¬ 
ters all have approximately the same 2 = 0 mass (see Section 
3.1.2||, this means that we expect to select the same fraction 


of the (proto) clusters at each redshift. 


2.3.2 Colour cuts at z > 1.3 


Without spectroscopic measurements we cannot ascertain 
true cluster membership; however, with the available pho¬ 
tometry we can remove low redshift foreground contami¬ 
nants. The Spitzer IRAC colour cut [3.6] — [4.5] > —0.1 
| |Papovich||2008] ) was used to select sources at 1.3 < 2 < 
3.2. Hereafter we refer to these mid-infrared colour-selected 
sources as “IRAC-selected sources”. This colour cut effec¬ 
tively selects galaxies at 2 > 1.3, with only 10-20% con¬ 


tamination from foreground sources (Muzzin et al. 20131. 


Potential contaminants include strongly star-forming galax¬ 
ies at 0.2 < 2 < 0.5 and powerful AGN at all redshifts. 
To remove these bright interlopers, as well as other bright 
foreground sources, we apply a further cut of i' — [3.6] > 
—0.5 X ]3.6] -I- 11.4, shown in Figure]^ by the red line. This 
line was derived from UDS data, using the photometric red¬ 
shift information to determine where foreground contami¬ 
nants are most likely to lie in colour-magnitude space. From 
the UDS, contours of the probability of a source lying at 
2 < 1.3 were derived. This cut is a linear ht to the con¬ 
tour corresponding to a ^ 80% likelihood of a source lying 
at 2 < 1.3. This cut removes the brightest foreground con¬ 
taminants while retaining 99% of the IRAC-selected sources, 
likely to lie at 2 > 1.3. Throughout the rest of this paper 
we refer to the IRAC-selected sources which have colours 
above these cuts as our “high redshift sample”. After ap¬ 
plying these colour cuts, the CARLA fields are 1.5-2 times 
the density of the average field. Given the fact that we are 
observing a very deep cylinder (from 2 ~ 1.3), and a typical 
protocluster is at most ~ 40 Mpc (co-moving diameter) deep 
I Muldrew, Hatch fc Cooke||2015] ), an overdensity level of 2 
times the held is quite extreme, meaning these structures 
are highly likely to be forming clusters. 


2.3.3 Mass cuts 

As the CARLA survey covers a wide redshift range, from 
2 Gyr to 7 Gyr after the Big Bang, we are likely to de¬ 
tect more low-mass galaxies at low redshifts than at high 
redshifts. This may bias our results by giving a bluer av¬ 
erage colour at low redshift simply because we can probe 
further down the mass function than at high redshift. In 
order to better compare these clusters across redshift, we 
select galaxies with stellar masses of M* > 10^°'® Mq at 



18 19 20 21 22 

[3.6] 


Figure 1. Distribution of UDS sources by redshift in colour- 
magnitude space. Each pixel’s colour represents the mean redshift 
of all the sources within that pixel. We employ a colour-magnitude 
cut, shown by the red line, to remove bright foreground contam¬ 
inants, whilst retaining 99% of our IRAC-selected sources. 


each redshiflQ We approximate stellar mass using the [3.6] 
magnitude and i! — [3.6] colour as a mass proxy. For red¬ 
shifts 0.1 < 2 < 3.2 in steps of A 2 = 0.1, a line in the 
(i! — [3.6])-[3.6] plane was determined for a 10^'^ ® Mq galaxy 
using [Bruzual fc Chariot (20031 models. We used stellar pop¬ 
ulation models with exponentially declining star formation 
models following SFR oc with r of 0.01, 0.1, 0.5, 1 

and 10 Gyr. For all models we assume solar metallicity and 
that the stars are formed with the initial mass function of 
Chabrier (2003). We obtain SEDs of these models at a vari¬ 
ety of ages since the onset of star formation, ranging between 
0.5 and 12 Gyrs in 0.5 Gyr steps (but not allowing the model 
to be older than the age of the Universe). The best-fit line 
to these models then formed the mass cuts for each redshift 
bin. These mass cuts were applied to each (proto)cluster, 
according to its redshift, and are shown as dotted lines in 
Figure Al For clusters with 2 < 1.3, an evolving [3.6] magni¬ 
tude limit was also applied, to avoid faint, low mass galaxies 
entering the sample. This magnitude limit was calculated, 
using the same models as above, as the faintest possible mag¬ 
nitude that a 10^°'® Mq galaxy could have at each redshift. 


^ Note that taking a 10^®'® Mq mass cut does not fully account 
for progenitor bias | |Mundy, Conselice fc Ownsworth|2015| . The 
lower mass progenitors of M* = 10^'^“” Mq galaxies at low redshift 
will not be selected by our mass cut at high redshift. This means 
that there will be additional galaxies that enter the sample at low 
redshift that are not detected at high redshift. These low mass 
objects will typically have bluer colours, which may cause our 
measured colours to be progressively bluer at lower redshifts. An 
evolving mass cut, or a constant number density selection would 
provide a more accurate measure of the evolution of the (i' — [3.6]) 
colour, however these cuts are dependent on the galaxy evolution 
model adopted and are beyond the capabilities of the current 
data. 
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Figure 2. Left: i' — [3.6] colour-magnitucie diagrams for J105231.82+08060 at 2 = 1.64 and J101827.85+05303 at 2 = 1.94, two of 
the 37 CARLA fields. J105231.82+08060 appears to show a dense core of red galaxies, perhaps indicating that the red sequence is 
being populated, whereas the red sources in J101827.85+05303 are more spread out across the field. Red squares show sources within 
1 arcmin of the RLAGN. Grey plus symbols show those sources lying further than 1 arcmin from the RLAGN, which are more likely to 
be contaminants. The blue dashed lines show our Icr i' median depth. Sources fainter than the Icr median depth are set to IcTrned 
shown as lower limits. The dotted blue lines show a M* > 10^*^'® Mq mass cut and the dash-triple dotted blue lines show the cut used to 
remove low redshift contaminants. The RLAGN in these fields are too bright to fit on this scale and are not shown. No sources have been 
removed through statistical subtraction here. The colour-magnitude diagrams of the other 35 CARLA targets are shown in Appendix 
[A| Right: i',[3.6],[4.5] three-colour images of J105231.82+08060 and J101827.85+05303, showing the larcmin radius apertures, within 
which cluster galaxies are selected. The central RLAGN is marked with a green square. 


2.4 Colours of (proto)cluster galaxies 


In Figure we show the colour-magnitude diagrams of 
two of the CARLA fields, J105231.82+08060 (imaged with 
GMOS-S) and J101827.85+05303 (imaged with ACAM) at 
2 = 1.64 and 2 = 1.94, along with their i' , [3.6], [4.5] three- 
colour images. Red squares show sources within 1 arcmin of 
the RLAGN, and grey plus symbols show those sources ly¬ 
ing further than 1 arcmin from the RLAGN, which are likely 
to contain a higher fraction of field contaminants. The blue 
dashed lines show our Icr i' median depth. Due to the depth 
of our i' data we cannot probe the faint red population, 
although at these redshifts the red sequence is depleted at 
faint magnitudes (e.g. [Papovich et al.||2010 l. The faint red 
sources shown as limits are also likely to be cluster members. 
The dotted blue lines in Figure!^ show the mass cut used 
to select galaxies with M» > 10^'^ M©, as described in Sec¬ 


tion The colour-magnitude diagrams of the remaining 
CARLA clusters are shown in Appendix [X] 

We measure the median i' — [3.6] colour of the CARLA 
cluster galaxies (used in Section]^ by dividing the colour- 
magnitude diagram of the clusters into grid cells (see Figure 
and statistically subtracting the expected number of field 
galaxies in each grid cell before taking the median colour of 
the remaining galaxies. The full method is as follows: 

We use the UDS data to derive the average number 
of sources expected from field contamination, and their ex¬ 
pected distribution in i' — [3.6] vs. [3.6] colour-magnitude 
space. Colour-magnitude diagrams were calculated for 401 
randomly located larcmin radius regions in the UDS. The 
colour-magnitude diagrams of the 401 random UDS regions 
were then divided into twelve grid cells (Figure]^ and the 
mean number of sources in each cell was measured (mSi^)- 
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The errors on these numbers were taken as the standard 
deviation of the number of sources per cell. These were 
normalised such that X]so that overall 
Utotai i were used, although distributed according to 

the total populatior0 All these values were calculated after 
applying the appropriate mass cut for the CARLA field be¬ 
ing investigated, so the field and cluster were treated in the 
same way throughout. 

In order to statistically remove field contaminants, in 
each (proto)cluster field randomly-selected sources were re¬ 
moved from each cell before calculating the median i' — [3.6] 
colour of the remaining sources. The number of randomly- 
selected sources removed each time was taken from a Gaus¬ 
sian centred on uflif with a width of cr™®, helping to deal 
with sample variance in interloper galaxies. This was re¬ 
peated for 1001 iterations to give an overall median colour. 
The mean colours were calculated in the same way, though 
sources with i' band magnitudes fainter than 1 (Jmed were 
set equal to 1 (Jmed value (shown as lower limits in Figure 
[^. The mean i' — [3.6] colours typically differ by ~ 0.13 mag 
compared with the median colours and at most differ by 
~ 0.34 mag. The median colours are typically redder than 
the means, with seven exceptions (> 80% are redder). We 
use the median colours hereafter in order to avoid biasing 
our results, although we emphasise that there is good agree¬ 
ment between the mean and median colours. 

The median and mean [3.6] —[4.5] colours were measured 
similarly, dividing the ([3.6] — [4.5])-[3.6] colour-magnitude 
diagrams into cells and statistically removing held contami¬ 
nation. The median [3.6] — [4.5] colours are used throughout. 



18 19 20 21 22 23 24 

[3.6] 


Figure 3. Example of the grid cells used in calculating the av¬ 
erage i' — [3.6] colour of each cluster. In each cell, FfBi ^ 
sources from each cluster field were removed iteratively before cal¬ 
culating the average colour, in order to statistically remove the 
field sources in each CARLA (proto)cluster candidate region. This 
grid is as fine as possible, whilst still ensuring a significant num¬ 
ber of sources in each cell. To guide the eye, shading shows the 
normalised number density distribution of IRAC-selected UDS 
sources. The depth of the 3.6 /rm images is shown by the dotted 
line. Also overlaid are red points showing the colour-magnitude 
diagram of J213638.58-1-00415 at z = 1.94. See Appendix [A] for 
full details of the colour-magnitude diagrams. 


2.5 Low & intermediate redshift cluster sample 


In order to compare the results of the high redshift CARLA 
clusters in this paper to lower redshift clusters, we use pho¬ 
tometric catalogues from the IRAC Shallow Cluster Survey 
(ISCS; [Eisenhardt et al. 20081, covering the Bootes region 
of the NOAO Deep Wide-Field Survey (NDWFS; Jannuzi & 
Dey|1999 1. In Eisenhardt et al. ( 2008| 335 cluster and group 
candidates were identified spanning 0 < z < 2. These form 
a low and intermediate redshift cluster comparison sample. 
We also include in our sample two higher redshift clusters 
discovered in the same sky region by the IRAC Distant Clus¬ 
ter survey (IDCS; the deeper IRAC extension of the ISCS), 
at z = 1.75 ([Stanford et al.|2012 Brodwin et al.|2012| Gon¬ 


zalez et al. |2012|) and z = 1.89 ||Zeimann et al. |2012p. 

The NDWFS Johnson I magnitudes (Eisenhardt et al. 


20081 were converted to SDSS i' magnitudes using the R — I 


colours: 


i'= 0.004 + 0A6{R-I) + I (1) 

This conversion was derived using [Bruzual &; Chariot] ( |2003| ) 
models with both exponentially declining star formation 
models following a star formation rate (SFR) oc with 

T of 0.01, 0.1, 0.5, 1 Gyr and simple stellar population mod¬ 
els where stars form in a single burst at high redshift and 
passively evolve thereafter. A linear equation was then fit to 
the model galaxy {R — I) and {i' — I) colours. 


^ (tFP®, was measured from the number distribution of all sources 

total 

in the 401 random regions, before applying the grid. 


For the low and intermediate redshift sample, galaxies 
were selected if they reside within 1 arcmin of the cluster 
centre and have a Spitzer IRAC colour of [3.6] — [4.5] 0 

for clusters with 0 < z < 1.3, thus removing contaminants 
at higher redshifts. 

Selecting galaxies within a constant 1 arcmin radius of 
the cluster centre at z < 1.3 corresponds to an increasingly 
smaller fraction of the (proto)cluster towards lower redshift 
( Muldrew, Hatch fc Cooke||201^ . This effect is small (at 
most an 8% decrease in the area observed between z = 1.3 
and z = 0.5 Muldrew, Hatch & Cooke 20151, however it 
may bias our selection towards the very core of the lowest 
redshift clusters, and potentially bias our colours to those of 
the most massive cluster galaxies, with the reddest colours, 
due to the SFR-density relation. This effect is unlikely to 
cause a bias in our results for two reasons: first, we are only 
selecting the most massive cluster galaxies in each cluster, 
which are likely to be in the central cluster regions anyway. 
Secondly, Eisenhardt et al. {20081 used a constant physical 
radius for their cluster galaxy selection. This would have 
the opposite effect: selecting a larger fraction of the lower 
redshift clusters. Our results for the ISCS clusters agree with 
the results found in Eisenhardt et al. (20081 and thus are 
unlikely to be biased by our choice of aperture size. 

Clusters at z > 1.3 were treated in exactly the same way 
as the 37 CARLA clusters, as described above. Mass cuts of 
M* > 10'^°'® Mq were taken for all clusters, as described in 
Section 12.3.31 
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Figure 4. Average i' — [3.6] colours for the ISCS clusters, mea¬ 
sured in the same way as for the 37 CARLA clusters. Overlaid 
is a “red spike” model (solid line), where stars are formed in a 
0.1 Gyr burst beginning at = 3, as in |Eisenhardt et al.|(|2008|. 
Also shown are simple stellar population (SSP) models, where 
stars form in a delta burst at Zf = 2, 3, 6 (dashed lines). There 
is very little difference in the resulting colours of the red spike 
model and a SSP with Zf = 3. The measured colours agree with 
a formation redshift for these cluster galaxies o{ Zf ~ 3, in agree¬ 
ment with |Eisenhardt et al.| ( |2008| l, however note the degeneracies 
between the different models at ^ < 1. Points overlaid with grey 
crosses show low redshift clusters whose colours are affected by 
secondary overdensities at higher redshift, which have colours of 
i' - [3.6] ~ 4. 


2.5.1 Testing the method 

The average i' — [3.6] colours for the ISCS clusters are shown 
in Figure]^ Although no cluster membership information is 
used in this stndy, the average cluster galaxy colours agree 
well with those found in Eisenhardt et al^ ( |2008[ ) who used 
photometric and spectroscopic redshifts to determine cluster 
membership. The trend of increasing i' — [3.6] colour with 
redshift agrees with a formation redshift for these cluster 
galaxies of ~ 3, showing larger scatter in the colours at 


higher redshift, as found in Eisenhardt et al. (20081. This 


proves that the statistical subtraction method used in this 
paper to measure average cluster galaxy colours can repli¬ 
cate the results found when cluster membership information 
is taken into account. Four clusters lie significantly off the 
Zf = ?> trend at z ~ 0.4 (shown with light grey crosses in 
Figure]^. The colour-magnitude diagrams for these clusters 
were visually inspected and found to show secondary struc¬ 
tures at higher redshift, which have colours of i! — [3.6] ~ 4. 
Since we cannot separate out and remove these potential 
higher redshift clusters using our method, we instead re¬ 
move these four clusters from the ISCS sample. At 2 < 1 
the ISCS data lies slightly above the model. This is due to 
the mass cut we employ, which selects just the most massive 
galaxies in order to be consistent with the higher redshift 
data. This slight offset is expected from [Eisenhardt et ah] 
(20081, where the most luminous (massive) cluster galaxies 
were systematically redder than simple stellar population 
models due to the mass-metallicity relation. 


3 RESULTS 


The median i' — [3.6] and [3.6] — [4.5] colours of the CARLA 
clusters are plotted as a function of redshift in Figure[^ For 
each field, the redshift of the central RLAGN is used as the 
cluster redshift. We also plot the colours for two spectroscop¬ 
ically confirmed clusters, for comparison: CLG 0218.3—0510 


at 2 = 1.62 (Tanaka, Finoguenov & Ueda 2010 Papovich 
et al.jpolo I and the protocluster around the Spiderweb ra¬ 


dio galaxy, PKS 1138—262 at 2 = 2.156 (Pentericci et al. 


20001). Their colours were measured as described in Section 
2.4| In the bottom panels of Figurej^we plot the character¬ 


istic [4.5] magnitude, m] )^ 5 ], m easured by Wylezalek et al. 
( 2014|) . [Wylezalek et al. (|2014[) studied the luminosity tunc- 


tions of CARLA clusters within three density bins. Since 
most of the CARLA clusters in our present study are more 
than 4 ct denser than the average held, we use the m *4 5 ] de¬ 
rived for the highest density bin used in the [Wylezalek et al.[ 
(2014) study. 

To determine the galaxy formation history of the clus¬ 
ters we compare the average i' — [3.6], [3.6] — [4.5] colours 
and rn *4 5 ] values to three simplistic models (see Figures: 
simple burst models (SSP; Section 113 , exponentially oe- 
clining models (CSP; Section 3.2 1 and multiple burst mod¬ 


els (mSSP; Section 3.31. We generate model galaxies using 
the publicly available model calculator, EzG al ([Mancone] 
with Bruzual & Gharlot (20031 mod- 


match the observed m* of galaxy clus¬ 
ters at 2 ~ 0.82, [4.5] = 19.82 (AB) ( [Mancone et al.[|2012p . 
The scatter in the average [3.6] — [4.5] colours is very large, 
~ 0.2 mag, meaning it is difficult to constrain a formation 
history using these colours. They are consistent with all the 
models we examine in the following Sections and are not 
discussed further. 


3.1 Simple Stellar Population model 

3.1.1 Model description 

The hrst model we examine is a single simple stellar popu¬ 
lation (SSP), where galaxies form in an instantaneous burst 
(hereafter referred to as a delta burst) at 2 / and passively 
evolve thereafter (see the top panel of Figure |^. Such a 
model is commonly used in the literature to estimate the 
formation epoch of cluster galaxies and provides a good ht 
to the 2 < 1.5 data (see Figure]^. 

In the left hand column of Figurej^we compare the SSP 
models with a range of formation redshifts to the average i' — 
[3.6] colours of the GARLA clusters, as well as the average 
[3.6] — [4.5] colour and the characteristic magnitudes m *4 gj. 
Whereas the SSP model with 2 / = 3 agrees with the m *4 gj 
values well at all redshifts, no SSP with a single formation 
redshift is able to match the i! — [3.6] colour data. For the 
CARLA clusters at 2 < 2.5 a formation redshift of 2 / ~ 2.5- 
3 seems to fit the average i! — [3.6] colours well. For clusters 
at higher redshifts, however, the measured average colour 
seems to imply a higher formation redshift. This means that, 
either the basic SSP model is a poor representation of the 


® We have also tested our models usi ng[Maraston| ([2005^ models 
(see Appendix 0 but find that the [Bruzual &: Charlot| l |2003| | 
models provide a better fit to our data. 
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Figure 5. Left: SSP models. Centre: CSP models: exponentially declining models with an e-folding time of 1 Gyr. Right: mSSP models: 
multiple, normally distributed, bursts of star formation with Zp^ak = and FWHM = 1 Gyr. Each column shows the predicted i' — [3.6], 
[3.6] — [4.5] and gj for zj [zp for mSSP) = 2.0, 2.5, 3.0,4.0, 5.0. Overlaid are the data points from the CARLA clusters (black squares), 
as well as two non-CARLA (proto)clusters from previous studies (red crosses). Confirmed (proto)clusters are circled in dark blue. Top: 
The i' — [3.6] colours. The points show the median and errorbars represent the standard deviation of the median from 1001 iterations plus 
a 10% error in the fluxes. Middle: The [3.6] — [4.5] colours. Points show the median source colour and the error on the median. A 20% 
error on the flux measurements is not shown, but would add 0.3 mag to the error bars. Bottom: The data points in the highest 

density bin from |Wyle 2 alek et al.| | [2014[ |. In the top two panels, grey-blue points show those (proto)clusters which lie significantly off the 
trend found in Figure [7| 


galaxy formation history of clusters above « ~ 1.3; or the 
CARLA clusters selected at 2 ~ 3 formed earlier than those 
at 2 ~ 1.5, and thus they do not all lie on one evolutionary 
sequence. We explore which of these scenarios is likely to be 
the case in the following Section. 


3.1.2 Are CARLA clusters an evolutionary sequence? 

The CARLA survey imaged over 400 high redshift RLAGN 
across the entire sky. From this survey, over 230 RLAGN 
appear to be located in regions denser than 2a above av¬ 
erage ( [Wylezalek et ar]|2013[ ). Finally, from these ~ 230 
(proto)cluster candidates we selected the 37 most overdense 
candidates in every redshift bin (see the grey points in the 
left hand panel of Figure]^. Therefore, through our selec¬ 
tion method, we have isolated some of the most overdense 


(proto)clusters across 1.3 ^ 2 ^ 3.2. According to hierarchi¬ 
cal structure formation, the most overdense regions at 2 = 3 
should evolve into the most overdense regions at 2 = 1.3 
(and subsequently 2 = 0), albeit with a large amount of 
scatter ( Ghiang, Qverzier fc Gebhardt||2013 |. So we expect 
the GARLA (proto)clusters in this study to form an approx¬ 
imate evolutionary sequence, with the high redshift proto¬ 
clusters being the statistical ancestors of the lower redshift 
clusters in our sample. 


To test this hypothesis, we examine the galaxy growth 
within the GARLA clusters. With the red crosses in Fig¬ 
ure]^ we show the overdensity (left panel) and abundance 
(right panel) of M, > 10^°'® Mq galaxies within 1 Mpc of 
the RLAGN. These red points show a trend of increasing 
overdensity and abundance towards lower redshift, with a 
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Spearman’s rank correlation coefficient of r = —0.72. This 
is a highly significant trend with a p-value of 5.5 x 10“^. 

This trend was not artificially introduced by our mass 
cuts, which evolve with redshift. We tested this by randomly 
reassigning the redshifts to our 37 CARLA clusters, which 
in turn randomised the mass cut line taken for each clus¬ 
ter. This was iterated 1000 times and the trend of increas¬ 
ing overdensity with redshift re-examined. The randomized 
samples produce a very weak correlation of median value 
r = —0.29, which is not significant (median p= 0.08). There¬ 
fore, randomising the redshifts (and therefore the mass cut) 
for each cluster does not produce a significant trend with 
redshift. 

Furthermore, this trend is not due to massive galax¬ 
ies entering the cluster from the outskirts region, because 
our 1 arcmin apertures contain the same fraction of the 
(proto)clusters at all redshifts between z = 3.2 and 2 = 1.3. 


Chiang, Overzier & Gebhardt (20131 show that most of 


cluster collapse occurs at 2 ; > 1 when viewed in the co¬ 
moving reference frame. However, in physical units, the 
cluster’s effective radius stays relatively stable until 2^1 
because gravity is almost balanced by the Hubble expan¬ 
sion ( Muldrew, Hatch fc Cool^|2015 1. Our 1 arcmin radius 
(~ 0.5 Mpc physical) apertures track the same fraction of 
the (proto)clusters across the 1.3 < 2 < 3.2 epoch. Thus, 
the trend in Figure]^ is not caused by cluster collapse, but 
rather is due to galaxy growth within the (proto)clusters. 

In hierarchical cosmology it takes time for massive 
galaxies to assemble, therefore we use the abundance of 
massive galaxies as a proxy for cluster maturity. The in¬ 
crease in massive galaxy abundance therefore suggests an 
increase in cluster maturity. To test this hypothesis we com¬ 
pare the trend in Figure to the expected galaxy growth 
within semi-analytic models. We use the Guo et al. (20111 
semi-analytic model built upon the Millennium Dark Matter 
Simulation ( [Springel et al.|[2005p . A full description of the 
models and identification of (proto)cluster members is pro¬ 


vided in Cooke et al. (20141 and Muldrew, Hatch & Cooke 
( [M^ . In brief, we devolve 1938 clusters with 2 = 0 halo 
masses of > lO^'^ h~^ Mq back in time and trace their mem¬ 
ber galaxies. At each output redshift we count the number 
of progenitor galaxies with M* > 10^°'® h~^ Mq. The solid 
black line in Figure shows the evolution of the number of 
M* > 10^°'® Mq (proto)cluster galaxies from these models, 
normalised to the least squares fit to the data. Although the 
detailed physics of the semi-analytic models is uncertain, 
the general trend is in good agreement with the data. This 
provides compelling evidence that the growth in abundance 
of massive galaxies within these CARLA clusters suggests 
that they are likely to form an evolutionary sequence: the 
high redshift protoclusters could be the statistical ancestors 
of the lower redshift clusters in this sample. 


3.1.3 SSPs cannot explain high redshift colours 

We have shown that the increase in abundance of massive 
galaxies within the CARLA clusters follows the expected 
trend of galaxy growth within forming clusters. We therefore 
suggest that these CARLA clusters lie on an approximate 
evolutionary sequence, i.e. the lower redshift clusters have 
the expected properties of the descendants of the higher red¬ 
shift protoclusters in our sample. Therefore the colour data 


in Figure must be fit by a single formation model. How¬ 
ever, although a single SSP of any 2 / > 2 fits the colours of 
2 < 1.5 cluster galaxies, at high redshift, we cannot fit one 
formation epoch to all the data across 1.3 < 2 < 3.2. This 
implies that cluster galaxies did not form concurrently at 
high redshift, but rather a more complex formation history 
is required. 

We also note that the majority of these CARLA clusters 
are to be richer than the high redshift ISCS clusters. This is 
discussed further in Section [4.51 


3.2 Composite Stellar Population model 

3.8.1 Model description 

In order to try to fit the unevolving colour in the data, 
we next examine a composite stellar population (CSP), 
where each of the galaxies undergo an exponentially decay¬ 
ing SFR {SFR oc exp —tlr) starting at 2 /, with an e-folding 
timescale r. This is represented by the upper-middle panel 
of Figurej^ CSP models were examined with r =0.1, 1 and 
10 Gyr. All galaxies are assumed to have formed concur¬ 
rently. The short e-folding time of r = 0.1 Gyr gives similar 
results to the SSP models, and the r = 10 Gyr models can¬ 
not produce i' — [3.6] colours redder than 1.5. The CSP mod¬ 
els with r = 1 Gyr are shown in the centre column in Figure 

t CSP models with Zf > 2 fit the 5 ] values at low red- 
ift, however they cannot explain the bright magnitudes at 
2 = 2.5. Although this model succeeds in producing a flat¬ 
ter i' — [3.6] colour trend with redshift, the colours are still 
too blue to fit the CARLA data. This means that massive 
cluster galaxies could not have formed their stars gradually 
in one long period of star formation unless there is a large 
amount of dust attenuation. 


3.2.2 Dust extinction 


Dust attenuation in the cluster galaxies will cause their 
colours to appear redder. Adding dust to the CSP mod¬ 
els would make the models redder. Because the models get 
bluer at higher redshift, in order to fit the flat colour trend 
of the data, we require a varying amount of dust extinction 
(Ay) with redshift. Assuming the Calzetti extinction law, 
to match the CSP model^ we would require Ay ~ 1.8 at 
2 = 3, Ay ~ 1.3 at 2 = 2 and Ay ~ 1.1 at 2 = 1.3. This level 


of dust extinction is not extreme for these redshifts ([Cam 
fc Best|2010 Cooke et al.||2014 (. 


A number of recent studies have found large numbers of 
dusty, star-bursting galaxies in high-redshift (proto)clusters 
(e.g. Santos et al. 2014 2015 Dannerbauer et al. 20141. 


These large numbers do not necessarily mean that the dusty 
star-forming population represent the majority of the clus¬ 
ter population. Indeed, despite the increase in star-formation 
Papovich et al. ( 2012|) found that the majority of clus- 


rates. 


ter galaxies in the central regions are passive. 

The colours plotted in Figure]^ are the median values 


^ In order to match the SSP models, we would require a varying 
amount of dust extinction with redshift, with Ay ~ 1.8 at 2 = 3, 
and Ay Si 0.7 at 2 = 1.3. This amount of dust extinction in 
passively evolving galaxies is unlikely, due to the lack of on-going 
star formation. 
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for each cluster. This means that a large fraction of the 
galaxies would need to be dusty in order to affect the over¬ 
all median colour we measure. Up to 10% of UDS sources 
(with all our selection criteria and cuts applied) are de¬ 
tected at 24 fj,m. This suggests that the fraction of galaxies in 
our CARLA sample that are extremely dusty, star-forming 
galaxies is less than 10 %, and therefore are unlikely to affect 
our measured median colour. 

3.2.3 CSP cannot explain cluster colours without dust 

The CSP models shown here do not produce colours which 
are red enough to explain the observed (i' — [3.6]) data. In or¬ 
der to match the data, we require a significant fraction of the 
cluster population to be dusty, highly star-forming galaxies, 
and have an average dust attenuation that increases with 
redshift. Previous studies have shown that a significant frac¬ 
tion of the massive cluster population are likely to be pas¬ 
sive (at least up to 2 ~ 1-6), so it is unlikely that these CSP 
models are correct for this massive cluster galaxy popula¬ 
tion. Furthermore, significant dust extinction would bring 
further discrepancy between the models and the values of 

’^[ 4 . 5 ] ■ 

3.3 Multiple Simple Stellar Populations model 

We have shown that the epoch of massive cluster galaxy 
formation has to be extended, but the CSP models which 
extend the period of star formation cannot fit the data un¬ 
less we incorporate a significant amount of dust. In order 
to produce the observed red {i' — [3.6]) colours at 2 = 3, at 
least some of the cluster population must already be passive 
at high redshift. 

3.3.1 Model description 

In order to produce passive M > 10^°'® M© galaxies by 2 = 
3, we again model galaxy formation as single bursts of star 
formation. To extend the period of cluster galaxy formation, 
and produce an approximately unevolving colour trend with 
redshift, we use multiple simple stellar populations (mSSP), 
where cluster galaxies are formed in individual, short bursts, 
with their formation redshifts distributed in time so that the 
total cluster population forms over the course of a few Gyr. 
This is illustrated in the lower-middle panel of Figure]^ The 
green histogram represents the relative fraction of galaxies 
being formed. 

We create mSSP models with 16 model galaxie^ which 
each form their stars in delta-bursts, with their formation 
redshifts normally distributed in time around a peak at red¬ 
shift Zpeak. The normal distribution’s FWHM is set to 1 Gyr. 
We then take the median colour of these model galaxies at 
each redshift. The right hand column of Figure shows the 
mSSP models for the i' — [3.6], [3.6] — [4.5] and 5 ] val¬ 
ues with different Zpeak. Including multiple bursts of star 
formation flattens the expected i' — [3.6] colour over red¬ 
shift and provides good agreement with the CARLA data. 

® On average there are 16 cluster galaxies within 1 arcmin of the 
RLAGN in each of the 37 CARLA fields, once field contaminants 
are statistically removed. 



Figure 6. Cartoon representation of the three models described 
in Sections |3.H3.3| as well as a model based on the star formation 
rate density of the Universe, described in Section |4.2| Top: SSP 
model: stars form in a single delta burst at 2 y = 3 and passively 
evolve thereafter. Upper middle: CSP model: the galaxy under¬ 
goes an exponentially decaying SFR {SFR oc exp—i/r) starting 
at Zf = 3, with r = 1 Cyr. Lower middle: mSSP model: galaxies 
are formed in multiple bursts of star formation, normally dis¬ 
tributed in time around 2peafc = 3 with FWHM= 1 Cyr. The 
green histogram illustrates the relative fraction of galaxies that 
are formed in each time interval. Bottom: Multiple burst model 
where the distribution of galaxies follows the cosmic star forma¬ 
tion rate density, see Section |4.2| for details. As above, the his¬ 
togram illustrates the relative fraction of galaxies that are formed 
in each time interval. 


The [3.6] — [4.5] and m *4 5 ] values are also consistent with 
Zpeak > 2.5. We have also tested this model with varying 
FWHM and find that the FWHM has to be > 0.9 Gyr in 
order to provide a good fit to the CARLA data. 


3.3.2 mSSP model provides a good description of the data 

The mSSP model describes the observed i' — [3.6] colours of 
these CARLA clusters well, and also agrees with the m *4 
values. We conclude that a more extended period of burst¬ 
like galaxy formation, spanning at least 1 Gyr, is required to 
explain the colours of the CARLA cluster galaxies. We have 
modelled these galaxies as forming in single bursts, but due 
to the scatter in our data we cannot constrain the individual 
star formation histories of the cluster members. The median 
i' — [3.6] colours mean that the galaxies must have ceased 
their star formation rapidly in order to produce red colours. 
This bursty appearance could also be produced with a vari¬ 
ety of star formation histories, so long as the star formation 
is rapidly terminated. Investigating these formation histo¬ 
ries is beyond the scope of these data and we just examine 
the most basic, burst models. 
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Figure 7. Left: 1 + 5 = AfQA.RLA/^UDS fo'' the clusters as a 
function of time. Grey points show the original densities, without 
taking any mass cuts. Red points show the densities after taking a 
IQiO.S mass cut. There is a significant trend of increasing den¬ 
sity over time. Clusters which do not lie on this trend are unlikely 
to be the progenitors/descendants of those on the trend. Right: 
^CARLA ~ ^UDS • This shows how the number of galaxies within 
1 arcmin of the RLAGN with M> 10^°'® Mq increases with time. 
The black line shows the expected evolution of the number of 
massive (proto)cluster galaxies from semi-analytic models. Blue 
points show the ISGS clusters, discussed in Section |4.5| High¬ 
lighted with grey circles are clusters that lie significantly off the 
expected trend for the CARLA clusters and thus are unlikely to 
form part of the evolutionary sequence. 


4 DISCUSSION 

4.1 Clusters undergo extended periods of galaxy 
formation 


We have examined three model star formation histories: a 
single stellar population, an exponentially declining SFR, 
and multiple bursts of star formation distributed normally 
around a peak period at Zp^ak- We find that SSP models 
(left hand column of Figure]^ are unable to account for the 
red i' — [3.6] colours of cluster galaxies at 2 > 2.5 and the flat 
colour trend we find at 2 > 1.3 (assuming that these clusters 
represent one evolutionary sequence; see Section 3.1.21. By 
examining the colours of cluster galaxies at 2 > 1.3 we are 
able to distinguish the cluster formation histories and have 
shown that the epoch of galaxy formation in clusters has to 
be extended; a single formation redshift is not sufficient to 
produce the colour trend we observe. 

We have shown that the {i' — [3.6]) colours of these clus¬ 
ter galaxies agree well with a model in which they formed 
in multiple short bursts over approximately 2 Gyr, peaking 
at 2 ~ 3. This is consistent with a model where different 
populations of galaxies form in individual bursts at different 
times, building up the galaxy population over time, rather 
than in one, short burst. This model is similar to the com¬ 
posite model from Wylezalek et al. (20141 used to explain 
the luminosity functions of CARLA clusters. Although we 
claim that the cluster galaxies formed over an extended pe¬ 
riod of time, our data are not sufficient to further constrain 
the galaxy formation history. A number of extended galaxy 
formation models could fit these data. 


Figure 8. The points are the same as in Figure]^ also showing 
the ISCS clusters at lower redshift as cyan diamonds. Overlaid are 
the mSSP model with multiple bursts of star formation normally 
distributed around Zp^^tk = 3, with FWHM=lGyr (solid line), 
and a multiple-burst model following the cosmic star formation 
history (dashed line). These extended galaxy formation models 
provide the best fit to our CARLA clusters at high redshift, and 
also account for the colours observed at lower redshift. The scatter 
in our data is too large to allow any further distinction between 
the two extended models. 


In order to further analyse the formation history of mas¬ 
sive cluster galaxies, we must adopt a model. Our following 
results do not strongly depend on the exact form of this ex¬ 
tended model, but we choose to follow recent literature (e.g. 
[Snyder et al.|[2C)12[ ) in assuming the cluster galaxies follow 
the star formation rate density trend of the Universe. 

We expand our mSSP model to follow the cosmic 
star formation rate by producing 500 model galaxies, each 
formed in a single short burst, distributed in time according 
to the star formation history of the Universe from |Hopkins1 
|fc Beacom] ( |2006| ): 


(a + bz)h 
1 -I- [z/cY 


( 2 ) 


where p* is the star formation rate density, a = 0.017, 
b — 0.13, c = 3.3, d = 5.3, and h = 0.7. The bottom panel 
of Figure illustrates this multiple-burst model. The red 
dashed line shows the overall shape of the cosmic star forma¬ 
tion rate, the green histogram indicates the relative fraction 
of galaxies that are formed in each time interval. 

In Figure |8 we show this model, as well as the mSSP 
model (Section 3.3[ ), with multiple bursts of star forma¬ 
tion around Zp^ak = 3 (normally distributed bursts across 
~ 2Gyr). This Figure illustrates that we do not have suf¬ 
ficient data to distinguish between different extended mod¬ 
els. Both models can also account for the colours of lower 
redshift clusters, providing a consistent explanation for the 
formation of all massive cluster galaxies. 


4.2 Formation timescale of massive galaxies 

In this Section we examine the period of time between high 
redshift cluster galaxies forming their stars and assembling 
into Mt > 10^°'® Mq objects. In hierarchical merging models 
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(e.g. De Lucia et al.|2006 1 galaxies form in small entities and 
subsequently merge. Therefore there may exist a long time 
delay between the period of star formation and their assem¬ 
bly epoch. If galaxies merge with little gas and no significant 
star formation (a “dry merger”), then the resulting massive 
galaxies will appear red. If the merger included a lot of gas 
(i.e. a “wet merger”, or if the galaxies formed via monolithic 
collapse, Eggen, Lynden-Bell fc Sandage|1962 1, and induced 
further star formation, then the resulting galaxies will have 
bluer colours. Thus we can use our data to estimate the time 
between star formation and assembly into M* > 10 ^°'® Mq 
galaxies. 

To do this we form 500 model galaxies distributed in 
redshift according to the cosmic star formation rate density 
(Equation and calculate an average i' — [3.6] colour at 
each redshift. To simulate galaxies growing in mass through 
dry mergers and entering our sample only after a certain 
period of time, we impose a restriction whereby galaxies are 
only included in our sample after a time delay At. 

Figure shows this model with different values of At. 
The CARLA data at 2 : > 2 are only consistent with a max¬ 
imum time delay of At « 0.5 Gyr. This short delay between 
galaxies forming their stars and growing massive enough to 
enter our sample is in agreement with studies of the luminos¬ 
ity function of galaxies at high redshift, which show that the 
bright end of the luminosity function is established within 
5 Gyr of the Big Bang (e.g.|De Propris et al.|2003 JAiidreon 


2006[ [Muzzin et al.||2008l [Mancone et al.||2010| [Wylezalek 


et al.|20T4 1. 


These results do not depend on the exact form of the 
cluster’s assembly history. We have tested different assem¬ 
bly histories (the best-fit normally distributed model from 
Section 13.31 and a model with the same form as the cosmic 
star formation rate density, but shifted to higher redshifts) 
and found no qualitative difference in these results. Indi¬ 
vidual galaxies must still have assembled within 0.5 Gyr of 
formation of the majority of their stars. 

In summary, 2 > 2 massive (M* > Mq) clus¬ 

ter galaxies must have assembled within 0.5 Gyr of forming 
their stars. This could have happened in a number of differ¬ 
ent ways, such as: formation through a single massive burst; 
merging into massive galaxies soon after they formed their 
stars; undergoing a merging event which triggered a mas¬ 
sive starburst which dominated the observed colours of the 
galaxy thereafter. 

At 2 < 2, there can be a long delay (several Gyr) be¬ 
tween galaxies forming their stars and assembling into mas¬ 
sive galaxies. Dry galaxy merging is likely to become a much 
more important route by which massive galaxies form at 

2 < 2. 


4.3 Cessation of star formation within massive 
cluster galaxies 

Massive galaxies at 2 = 0 are passive and contain old stellar 
populations which suggest that they finished forming stars 
at 2 > 2 (e.g. Bower, Lucey fc Ellis|[l9^ |. The cosmic star 
formation model forms stars up to the present-day. In this 
Section we test whether a cut-off in galaxy formation at 
higher redshifts provides a better fit to the data. 

To test when massive galaxy formation ceased in clus¬ 
ters, we form 500 model galaxies following the cosmic star 



Figure 9. The expected average i' — [3.6] colours of clusters when 
the component galaxies are formed in single bursts, distributed 
according to the star formation rate density of the Universe. The 
points are the same as in Figure The models here include a 
time delay At between galaxies forming and being included in 
our selection. Different lines correspond to different values of At. 
Values of At larger than ~ 0.5 Gyr produce average cluster colours 
that are too red to explain the observations at 2 > 2. 


formation rate density (Equation]^, down to a defined red¬ 
shift Zend, i.e. with no more star formation occuring in mas¬ 
sive galaxies at 2 < Zend. Throughout this Section, we use 
At = 0. Figure 10 shows the average i' — [3.6] colour of the 


model galaxies with various different values of Zend- Higher 
values of Zend predict slightly redder colours at 1 < 2 < 2.5, 
however the scatter in our data does not allow us to quan¬ 
tify whether a termination of star formation at any par¬ 
ticular Zend is required. The reddening of galaxy colours is 
entirely due to the peak epoch of star formation occurring at 
2 ^ 2 ^ 3 and few stars forming in massive galaxies there¬ 
after. To determine when star formation in massive cluster 
galaxies ceased, we require measurements of the individual 
star formation rates of the cluster members. The average 
i' — [3.6] colours alone do not contain enough information. 


4.4 Is an extended galaxy formation history 
consistent with previous work? 

Previous studies have modelled the formation of cluster 
galaxies as a single concurrent event. Lower redshift data 
agree well with these models (e.g. Blakeslee et al.|2003 Mei 


et al.|2009 1. In this Section we test whether an extended pe¬ 


riod of galaxy formation is consistent with the observational 
data from previous work. 

In Figure we examine the trend of the rest-frame 
iU — B)o average colour and scatter predicted from the ex¬ 
tended model from Section[4^ following the cosmic star for¬ 
mation density, with different cut-off redshifts, Zend- Over- 


(Bower, Lucey & Ellis||1992 

Ellis et al.||1997 

van Dokkum 

et al.|1998 

Blakeslee et al.||2003| |Mei et al.||20091 |Papovich 

et al. 1(2010 

Snyder et al.||201 

21 at 0 < 2 < 1.6. 


The (U — B)o colour and scatter depend on the adopted 
value of Zend. Stopping galaxy formation at higher Zend 
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Figure 10. The same as Figure but the different lines here 
correspond to different cut-offs in star formation, i.e. star forma¬ 
tion is allowed to continue down to following the cosmic 

star formation rate density, and then terminated. The choice of 
Zend has little effect on the expected colours, suggesting that the 
cosmic star formation rate density itself is sufficient, requiring no 
cut-off. 


decreases the scatter and reddens the expected {U — B)o 
colours. Our model colours and scatters were calculated tak¬ 
ing the whole cluster population into account, whereas the 
data points were measured from red sequence galaxies only. 
Therefore the data are expected to have a redder colour and 
smaller scatter than the models, however almost all massive 


cluster galaxies at 2 < 1.5 exhibit red colours (e.g. Kaji- 


sawa & Yamada 20061. The models with Zend ~ 1-2 show 


good agreement with these previous results, although there 
is large scatter in the data, suggesting that our simple burst 
model following the cosmic star formation rate density of 
the Universe (with some reasonable Zend ~ 1-2) provides 
a possible explanation for the formation history of massive 
cluster galaxies. 


4.5 Cluster mass 


The luminosity functions of CARLA clusters are signifi¬ 
cantly different from the ISCS clusters, with the CARLA 
clusters having brighter m*^ 5 ] values than those from the 
ISCS (Wylezalek et al. 2014|^. T he lowest density bin ex- 
amined in |Wylezalek et al.| ( |2014'| ) showed results more con¬ 
sistent with the ISCS sample, suggesting that the lower¬ 
richness CARLA clusters are more similar to the clusters 
from ISCS. In this paper we only examine the densest 
CARLA clusters and in Figure we conhrm that members 
of this subset are richer than the 2 > 1.3 ISCS clusters. 

In the right hand panel of Figure we plot the number 
of massive (M* > 10^°'® M©) galaxies in the CARLA fields. 
We also show the excess number of galaxies in the ISCS 
clusters at 2 > 1.3. The majority of the high redshift ISCS 
clusters have systematically fewer massive galaxies than the 
CARLA sample. This indicates that the CARLA clusters 
are more overdense than the ISCS sample and are therefore 
likely to be more massive clusters and protoclusters. 


Brodwin et al. (2007 \ found that the correlation func¬ 


tion of ISCS clusters indicates they reside in dark matter 
haloes of ~ IO'^^'^Mq, and will evolve into clusters of 2 - 
3 X 10^'* Mq by 2 = 0. Figure 0 suggests that the majority 
of the 37 CARLA clusters in this study will collapse to be¬ 
come more massive clusters (> 5 x 10^^ M©) by the present 
day. 


Brodwin et al. (20131 suggest that the formation his¬ 


tory of massive cluster galaxies depends on the overall mass 
of the cluster in which they reside. They found that the 
lower-mass ISCS clusters were undergoing a major epoch of 
merging and galaxy formation peaked around 2 ~ 1.4, and 
predicted that the formation epoch would peak at higher 
redshifts for more massive clusters. Our CARLA data are 
consistent with a peak formation and assembly epoch of 
2 ~ 2-3 for the more massive CARLA clusters, in agree¬ 
ment with a higher assembly epoch for more massive clus¬ 
ters. With the {i' — [3.6]) data, however, we find that the 
cluster density does not affect the measured average colour. 
This is perhaps due to the broad range of wavelengths cov¬ 
ered by the observed i' — [3.6] colour at these redshifts. 

In summary, the CARLA fields studied here are un¬ 
likely to be the progenitors of the lower redshift ISCS clus¬ 
ters. They are likely to evolve into more massive clusters 
5 X IO'^^Mq). However, we do not think this will affect 
the results of Figures [8|10| as both populations exhibit sim¬ 
ilar average i' — [3.6] colours. 


4.6 Caveats 


4-6.1 CARLA sample are not yet confirmed 

Most of the CARLA clusters studied here are not yet spec¬ 
troscopically confirmed, which may affect our conclusions on 
the evolution of clusters. These helds were selected as the 
most dense CARLA fields, which are significantly denser 
than the average field at a > 4(j level, and thus are likely 
to contain protoclusters. Also, in Figure]^ the trend of in¬ 
creasing density towards lower redshifts suggests that most 
of our sample are indeed (proto)clusters. Some fields in our 
sample have clear evidence for a forming red sequence (see 
Figure Al 1 and three of our sample are spectroscopically 
confirmed structures: 7C1756+6520 (Galametz et al.|2010l, 
TXS 1558-003 ([Hayashi et al.|[M2 |, and MRC0943-242 
i Venemans et al.|2007 l. These three clusters, as well as the 
two non-CARLA clusters from Papovich et al. (20101 and 


Pentericci et al. (20001, follow the same flat trend in colour 


as the unconfirmed clusters at all redshifts. This all provides 
strong evidence that the majority of the CARLA fields in 
this study are likely to be (proto)clusters. Further spectro¬ 
scopic studies are required to conhrm this. 


4 . 6.2 AGN 

The presence of AGN may cause redder colours in our clus¬ 
ter sample. The fraction of AGN in clusters is known to be 
enhanced compared to the held (e.g. Galametz et al.|2010 |, 
which may affect the IRAG bands. Our use of median colours 
throughout should prevent small numbers of AGN signih- 
cantly affecting our measured average colours. 
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Figure 11. Predicted evolution of rest-frame {U — B)q colours for model galaxies forming as described in Section [4.2| with different values 
of Left: The average colours for all cluster galaxies. Right: The scatter in the {U — B)o colours for cluster galaxies. Overplotted in 

red are results from previous studies: |Mei et al.| ( |200^ (squares), [Blakeslee et ah] ( |200^ (diamonds), [Papovich et al.|l|2010[| (triangle), 
[Snyder et al.|(|2012 (stars). Three studies at low redshift are shown by the red crosses; these are data taken from [Bower, Lucey fc Ellisj 
([1992[|; [Ellis et al.[ 1997[l;[van Dokkum et ar]|[1998[l, adapted to rest-frame {U — B)o by[Mei et al.|([2009^. 


4-6.3 Blending 

The FWHM of the Spitzer 3.6 and 4.5/rm data is ~ 
1.7 arcsec. This means that source fluxes may be affected by 
blending with nearby sources, particularly in crowded fields. 
The i' data, although having a small FWHM, may also ex¬ 
perience some blending. If blending occurs between galaxies 
at similar redshifts, i.e. between cluster members, our con¬ 
clusions will be unaffected, as we measure median colours 
of clusters throughout. Blending with fore- or background 
sources may cause inaccuracies in the measured colours. Fur¬ 
ther data with better resolution is required to gain more 
accurate measurements of galaxy colours. 

5 CONCLUSIONS 

We have used a sample of 37 clusters and protoclusters 
across 1.3 ^ z ^ 3.2 from the CARLA survey of high- 
redshift clusters to study the formation history of massive 
cluster galaxies. These fields are the densest regions in the 
CARLA survey, and as such are likely to be the sites of for¬ 
mation for massive clusters. We have used optical i'-band 
and infrared 3.6 ^m and 4.5 ^m images to statistically se¬ 
lect sources likely to lie within these (proto) clusters and ex¬ 
amined their average observed i' — [3.6] colours. The abun¬ 
dance of massive galaxies within these (proto)clusters in¬ 
creases with decreasing redshift, suggesting these CARLA 
(proto)clusters form an evolutionary sequence, with the 
lower redshift clusters in the sample having similar prop¬ 
erties to the descendants of the high redshift protoclusters. 
This sequence allows us to study how the properties of their 
galaxy populations evolve as a function of redshift. By com¬ 
paring the abundance of massive galaxies in these CARLA 
(proto)clusters to those of 2 > 1.5 ISCS clusters we have 
shown that the CARLA sample are likely to collapse into 
more massive clusters, typically > 5 x 10^"^ M©. 

We have compared the evolution of the average colour 
of massive cluster galaxies with simple galaxy formation 


models. Taking the full cluster population into account, we 
have shown that cluster galaxies did not all form concur¬ 
rently, but rather formed over the course of a few Gyr. The 
overall colour evolution is consistent with the stars in each 
galaxy forming in a single burst, although more complex 
individual star formation histories that are rapidly trun¬ 
cated may produce this effect. This galaxy formation his¬ 
tory is consistent with galaxies within different groups of the 
(proto) cluster forming concurrently, but the whole cluster 
population building up over a longer period of time. Overall 
this produces an approximately unevolving average observed 
i' — [3.6] colour for cluster galaxies at 2 = 1.3 to 2 ~ 3. 

In summary, our main conclusions are as follows: 

(i) The average colours of massive cluster galaxies are rela¬ 
tively flat across 1.3 < 2 < 3.2. It is not possible to describe 
the formation of these galaxies with a burst model at a single 
formation redshift. Cluster galaxies formed over an extended 
period of time. 

(ii) The formation of the majority of massive cluster galaxies 
is extended over at least 2 Gyr, peaking at 2 ~ 2-3. From 
the average i' — [3.6] colours we cannot determine the star 
formation histories of individual galaxies, but their star for¬ 
mation must have been rapidly terminated to produce the 
observed colours. 

(iii) Massive galaxies at 2 > 2 must have assembled within 
0.5 Gyr of them forming a significant fraction of their stars. 
This means that few massive galaxies in 2 > 2 clusters could 
have formed via dry mergers. 
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APPENDIX A: COLOUR MAGNITUDE 
DIAGRAMS 

The i' — [3.6] vs. [3.6] colour-magnitude diagrams for the re¬ 
maining 35 CARLA fields (not showing Figure]^ are shown 
in Figure |A1[ There is a large scatter in the colours of 
sources, suggesting that each of these clusters still has con¬ 
tinuing star formation. 


APPENDIX B: MARASTON MODELS 


At high redshift (1.4 < 2 < 2.7), the treatment of the asymp¬ 
totic giant branch (AGB) phase of stellar evolution becomes 
important in the Spitzer wavebands. At these redshifts, the 
AGB effect is expected to be at a maximum. [Maraston et ah] 
( |2006[ ) showed that the AGB phase of stellar evolution can 
affect the measured age and mass of high redshift galaxies 
and produce systematically younger ages than [Bruzual fc| 
|Charlot| (|2003[) models. This effect is unlikely to be signifi¬ 


cant, as Kriek et al. (20101 showed that Bruzual & Chariot 


120031 provide better fits to post-starburst galaxy spectral 
energy distributions than |Maraston| ( |2005[ ) models which 
take into account the effects of AGB stars. 


Figure Bl. The same as the right hand colu mn of Figure]^ with 
[Maraston] ( |2005l l mSSP models used instead of jBruzual fc Chariot] 
(|2003|l models. 


We reproduce the mSSP models (Section 3.31 us¬ 


ing 


Maraston (20051 models (Figure Bll. Models with a 


Chabrier (20031 IMF were not available for the Maras¬ 
ton (20051 models so we use a Kroupaj (20011 IMF, which 
produces similar results. Qualitatively the [Maraston] ([2005 1 


models show the same trends as the IBruzual fc Chariot I 
(20031 models for the i' — [3.6] colours. The CARLA IRAC 
colours are better fit by [Maraston (20051 models, however 
the Bruzual & Chariot] (]2003]) models are also consistent 
within scatter in the colours and flux errors. We use lBruzuall 
]fc Charl^ (20031 models in our analysis as the models of the 
[4.5] magnitudes and i' — [3.6] colours give a consistent es¬ 
timate of Zpeak for the CARLA cluster data, whereas the 
[Maraston (20051 models for the [4.5] magnitudes suggest a 
much higher Zpeak than the i' — [3.6] colours. 
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Figure Al. i' — [3.6] colour-magnitude diagrams. Red squares show sources within 1 arcmin of the RLAGN. Grey plus symbols show 
those sources lying further than 1 arcmin from the RLAGN, which are more likely to be contaminants. The blue dashed lines show our 
l(j i' median depth. Sources fainter than the Icr median depth are set to termed shown as lower limits. The dotted blue lines show 
a M* > Mq mass cut and the dash-triple dotted blue lines show the cut used to remove low redshift contaminants. The RLAGN 

is shown by a red star. In some diagrams, the AGN is too bright to fit on the scale and so is not shown. For 7C1756+6520 confirmed 
cluster members from [Galametz et al.| ( [2010| | are shown by green circles. 


© 2015 RAS, MNRAS 000, 



















[9'e]-! [9'e]-! [9'e:]-! [9'e]-! [9'e]-! 


Formation history of massive cluster galaxies 19 



J115043.87-00235 


+ Q. ° j" ^ ' 

^ % □□ .r -H- 






J112338.14+052038.5 

z=2.18 


++ + ,■+ + + + 


4. 


H- + ,P 


, T S 04(4?” 

+-f* + fel-? • 

m ^D® ®+ 

-..Jiy - 




J140445.88-013021.8' 
' "2=2.50 


□ -y 

' • g °* /. 

y “ yz-yoy* tc 

:. - * * : 

^ _ 


■' V 


^ +° 4# + s , * 

• **8*?*.. Vt ift 

•*<o*a* »n A*yXb 

° ‘.tysf ^ 

t 4. 

- - - 4 □ ■ + ■'™*T 


y ■"+ °+++^'^+-*y 

■fj- + . □□ 4. +4- 4. + 

h *' 4 +t °+ 4 ^ 

. ** 4.s= ;**|!i*:* 

■■- .4° * °. ‘.yyyji 



3.6 



3.6 
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Figure Al - continued 
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